Improve SegNet with feature pyramid for road scene parsing

نویسندگان

چکیده

Road scene parsing is a common task in semantic segmentation. Its images have characteristics of containing complex context and differing greatly among targets the same category from different scales. To address these problems, we propose segmentation model combined with edge detection. We extend network an encoder-decoder structure by adding feature pyramid module, namely Edge Feature Pyramid Network (EFPNet, for short). This module uses detection operators to get boundary information then combines multiscale features improve ability recognize small targets. EFPNet can make up shortcomings convolutional neural features, it helps produce smooth After extracting encoder decoder, Euclidean distance compare similarity between presentation which increase decoder’s restore encoder. evaluated proposed method on Cityscapes datasets. The experiment datasets demonstrates that accuracies are improved 7.5% 6.2% over popular SegNet ENet. And ablation validates effectiveness our method.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Video Scene Parsing with Predictive Feature Learning: — Supplementary Material —

In this supplementary material, we provide more implementation details including the architectures and training settings of baseline models and PEARL. We also present the experimental results and analysis of PEARL on Camvid dataset, as well as more qualitative evaluations of PEARL. 1. Implementation Details Since the class distribution is extremely unbalanced in video scene parsing, we increase...

متن کامل

Hierarchical Feature For Scene Parsing Using Fully Recurrent Network

In scene parsing, the wide-range contextual information is not effectively encoded. Scene parsing provides segmentation and determines an scene into different regions associated with semantic categories. The main objective of scene parsing is to reduce semantic gap between humans and computer machines on scene understanding. The scenes parsing applications are object detection, text detection o...

متن کامل

SegNet: A Deep Convolutional Encoder-Decoder Architecture for Scene Segmentation

We present a novel and practical deep fully convolutional neural network architecture for semantic pixel-wise segmentation termed SegNet. This core trainable segmentation engine consists of an encoder network, a corresponding decoder network followed by a pixel-wise classification layer. The architecture of the encoder network is topologically identical to the 13 convolutional layers in the VGG...

متن کامل

Geometric Scene Parsing with Hierarchical LSTM

This paper addresses the problem of geometric scene parsing, i.e. simultaneously labeling geometric surfaces (e.g. sky, ground and vertical plane) and determining the interaction relations (e.g. layering, supporting, siding and affinity) between main regions. This problem is more challenging than the traditional semantic scene labeling, as recovering geometric structures necessarily requires th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: E3S web of conferences

سال: 2021

ISSN: ['2555-0403', '2267-1242']

DOI: https://doi.org/10.1051/e3sconf/202126003012